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SCALABLE VIDEO TRANSCODING 



TECHNICAL FIELD 
The described subject matter relates to video data encoding. More 
particularly, the subject matter relates to video transcoders for adjusting the bit rate 
of encoded video data. 

BACKGROUND 

Efficient and reliable delivery of video data is becoming increasingly 
important as the Intemet continues to grow in popularity. Video is very appealing 
because it offers a much richer user experience than static images and text. It is 
more interesting, for example, to watch a video clip of a winning touchdown or a 
Presidential speech than it is to read about the event in stark print. Unfortunately, 
video data can require significantly more memory, processor usage, and bandwidth 
than other data types commonly delivered over the Intemet. As an example, one 
second of uncompressed video data may consume one or more Megabytes of data. 
Delivering such large amounts of data over error-prone networks, such as the 
Intemet and wireless networks, presents difficult challenges in terms of efficiency, 
reliability, and network capacity. 

After video is encoded in according to a video coding format, the encoded 
video may be transmitted to a client that employs a decoder to decode and present 
the video. A decoder can decode the sequence of firames by performing 
operations that are substantially the inverse of the encoding operations. Prior 
to video transmission, it may be desirable to reduce the bit rate of the encoded 
video depending on various conditions, such as available ^network bandwidth, 
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channel capacity, CPU utilization, and client buffer state. By reducing the bit rate 
in response to changes in such conditions, the encoder can facilitate higher 
transmission performance and/or higher quality video presentation at the client. 

One approach to reducing the bit rate at the encoder is to employ a cascaded 
decoder-encoder. Fig. 2 illustrates the cascaded approach. As shown, a decoder 
202 is placed prior to an encoder 204, whereby the decoder fully decodes video 
input 206 prior to the encoder 204 re-encoding the video at the desired bit rate. 
While the cascaded approach may achieve the desired bit rate adjustment without 
reduction of video quality or drift error, the complexity of the system (e.g., a full 
decoder) is a serious drawback. 

SUMMARY 

Implementations of systems, methods, data structures, and computer 
program products described herein address the above, and other, problems and 
drawbacks by providing a scalable video transcoder that encodes video data frames 
at a desired bit rate. The transcoder maintains the underlying format of the input 
video, but merely changes the bit rate of the output video. The video transcoder 
can be scaled in complexity to achieve associated encoding goals including bit rate 
adjustment by selectively accumulating drift error, selectively compensating with 
drift error, skipping selected frames, and/or entering/exiting open-loop mode. 

An implementation of a method for transcoding input video data includes 
accumulating transcoding error associated with transcoding the input video into 
output video data encoded at a target bit rate, motion-compensating the 
accumulated transcoding error, error-compensating the input video data with 
motion-compensated accumulated transcoding error, and selectively disabling one 
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or more of the accumulating and error-compensating operations based on one or 
more conditions related to transcoding the input video data. 

An exemplary system for transcoding input video data includes an 
accumulating module accumulating transcoding error associated with transcoding 
the input video into the output video data, a motion compensation module 
compensating the accumulated transcoding error with motion estimation data, an 
error-compensating module compensating the input video data with motion- 
compensated accumulated transcoding error, a re-quantization module controlling 
a re-quantization level to achieve the target bit rate, and a compensation switching 
module operable to disable the error-compensating module in response to the 
motion-compensated accumulated transcoding error being less than a threshold 
value. 

Another implementation of a system for transcoding video data includes 
input video data encoded at an initial bit rate and means for transcoding the input 
video data to generate corresponding output video data encoded at a target bit rate, 
wherein the means for transcoding includes a re-quantization module having a 
look-up table including a plurality of dynamically selectable quantization 
parameters corresponding to associated bit rates. 

An implementation of a computer-readable medium includes computer- 
executable instructions for performing a method including accumulating 
transcoding error associated with transcoding input video having an associated 
initial bit rate into output video data having an associated target bit rate, motion- 
compensating accumulated transcoding error, error-compensating the input video 
data with motion-compensated accumulated transcoding error, and selectively 
disabling one or more of the accumulating and the error-compensating operations 
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in response to detecting one or more conditions related to transcoding the input 
video data. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of a video distribution system in which a content 
producer/provider transcodes video data and transfers the transcoded video data 
over a network to a client. 

Fig. 2 is a block diagram of a cascaded decoder/encoder arrangement that 
can transcode input video data at a selected bit rate. 

Fig. 3 is a block diagram of an exemplary video transcoder that can be used 
for encoding input video data at a target bit rate. 

Fig. 4 is a block diagram of another exemplary video transcoder that can be 
used for encoding I-frames of input video data at a target bit rate. 

Fig. 5 is a block diagram of another exemplary video transcoder that can be 
used for encoding B-frames of input video data at a target bit rate. 

Fig. 6 is a block diagram of an exemplary re-quantization module that 
employs a look-up table for rapid re-quantization of input video data. 

Fig. 7 illustrates an intra-quantization matrix and an inter-quantization 
matrix that can serve as exemplary implementations of the look-up table of Fig. 6. 

Fig. 8 is a block diagram of a target bit rate controller that may be used to 
dynamically control quantization levels to achieve a target bit rate of transcoded 
video data. 
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Fig. .9 illustrates an operation flow including exemplary operations for 
transcoding input video data at an original bit rate to output video data at a target 
bit rate. 

Fig. 10 illustrates an operation flow including exemplary operations for 
scaling complexity of a transcoder in response to system conditions and/or type of 
video frame. 

DETAILED DESCRIPTION 

This disclosure describes a video transcoder and associated operations that 
may be used in motion-compensation-based video coding systems to adjust the 
encoding of video data based on one or more system conditions or to achieve 
selected encoding objectives. The transcoder is described in the context of 
delivering video data over a network, such as the Intemet or a wireless network. 
However, the transcoder and associated operations have general applicability to a 
wide variety of environments. 

Various data transmission conditions, such as bandwidth fluctuation, CPU 
utilization, and client buffer state, can cause major problems encountered in 
transmitting video data over the Intemet or wireless channels. Adjusting the bit 
rate of transmitted data to adapt to such conditions without imposing substantial 
complexity on the encoder can also pose a challenge. The video transcoding 
scheme described below can facilitate adjustment of video bit rate in response to 
changes in these conditions with a scalable transcoder, while providing high 
coding efficiency and performance. 

Exemplary System Architecture 
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Fig. 1 shows a video distribution system 100 in which a content 
producer/provider 102 produces and/or distributes video over a network 104 to a 
client 106. The network 104 may represent of many different types of networks, 
including the Internet, a LAN (local area network), a WAN (wide area network), a 
SAN (storage area network), and wireless networks (e.g., satellite, cellular, RF, 
etc.). 

The content producer/provider 102 may be implemented in many ways, 
including as one or more server computers configured to store, process, and/or 
distribute video data. The content producer/provider 102 has a video storage 108 
to store digital video files 110 and a distribution server 112 to encode and/or 
transcode the video data and distribute it over the network 104. The video data 
files 110 are stored in an encoded format, such as the Motion Picture Experts 
Group-2 (MPEG-2) format. 

To assist in illustrating various concepts discussed herein, the MPEG-2 
format is briefly described. MPEG-2 video is composed of groups of pictures 
(GOPs). Each GOP includes a sequence of three types of video frames: intra 
coded frames (I-frames), predicted frames (P-frames), and bidirectional frames (B- 
frames). I-frames use discrete cosine transform (DCT) encoding to compress a 
single frame without reference to any other frame in the sequence. P-frames are 
encoded as differences from the last I or P-frame and use motion prediction to 
predict values of each new pixel. B-frames are encoded as differences from the 
last and/or next I or P frame, and also employ motion prediction. 

An MPEG-2 frame is composed of macroblocks. A macroblock is a 
fundamental unit of motion compensation. Each macroblock consists of a 16 x 
16 array of pixels and has associated motion vector(s). A motion vector 
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represents the horizontal and vertical displacement from the macroblock being 
encoded to the matching macroblock-sized area in the reference picture. MPEG-2 
is therefore referred to as a motion-compensation-based video coding scheme. 
Other motion-compensation-based video coding formats include MPEG-1, MPEG- 
4, H.261,H.263and H.264. 

With regard to providing MPEG-2 video data, the distribution server 112 
employs a central processing unit (CPU) 114, an operating system 116 (e.g., 
Windows NT, Unix, etc.), random access memory (RAM) 118, and a video 
transcoder 120. The distribution server 112 responds to requests for video data 
110, or portions thereof, by transcoding and transmitting the requested video data 
110 over the network. Typically, the video data 1 10 is transcoded and transmitted 
on a frame-by-frame basis. 

The video transcoder 120 encodes the video data 110 according to the video 
data 110 format at a selected, or target, bit rate prior to transmitting video data over 
the network 104. The target bit rate may differ from the bit rate of the original 
video data 110. The target bit rate may be set in response to one or more video 
transmission conditions, such as, but not limited to, network 104 bandwidth, 
channel capacity on the network 104, and CPU 114 utilization. 

Thus, the transcoder 120 performs a selected video format-to-selected video 
format transcoding of the video data 110 wherein the bit rate may be changed 
during transcoding. In an implementation using MPEG-2 format video data 110, 
the transcoder 120 receives MPEG-2 video data 110 at an original bit rate and 
generates MPEG-2 video data at a target bit rate. Other video formats, such as, 
but not limited to, MPEG-4, H.261, H.263, H.264, and 'WMV' (WINDOWS 



Lee & Hayes, PLLC 



7 



MSI-1695VS 
305876.01 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



Media Format) may be applied to the transcoder 120 and transmitted at an adjusted 
bit rate. 

In addition to encoding video data at a target bit rate, the transcoder 120 
may drop (i.e., not transmit) certain selected portions of the video data 110. For 
example, the video transcoder 120 may not transmit B-frames if the CPU 114 
utilization is too high. By dropping a portion of the video data 110, the quality of 
the video at the client 106 may be reduced slightly, but this may be more desirable 
than the consequences of failing entirely to encode the video data 110 rapidly 
enough. 

To facilitate adaptation to transmission conditions, the transcoder 120 may 
be scalable in complexity. For example, various portions of the encoding process 
of the transcoder 120 may be dynamically included in and/or excluded from the 
transcoder 120 to achieve desired encoding goals. 

The video transcoder 120 may be implemented in software, firmware, 
and/or hardware. The transcoder 120 is shown as a separate standalone module 
for discussion purposes, but may be constructed as part of the CPU 114 or 
incorporated into operating system 116 or other applications (not shown). 
Various exemplary implementations of the transcoder 120 and associated 
processes and operations are illustrated and described in fiirther detail below with 
regard to Figs. 3-10. 

The client 106 is equipped with a CPU 128, a memory 130, and one or more 
media output devices 132. The memory 130 stores an operating system 134 (e.g., 
a WINDOWS-brand operating system) that executes on the CPU 128. The 
operating system 134 implements a client-side video decoder 136 to decode the 
layered video streams into the original video. In the event data is lost, the 
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decoder 136 is capable of reconstructing the missing portions of the video from 
frames that are successfully transferred. Following decoding of each video 
frame, the client 106 plays the video frame via the media output devices 132. 
The client 106 may be embodied in many different ways, including a computer, a 
handheld entertainment device, a set-top box, a television, and so forth. 

Exemplary systems for encoding video data at a target bit rate 

Fig. 2 is a block diagram of a cascaded decoder/encoder arrangement 200 
that can transcode MPEG-2 input video data at a selected bit rate. The cascaded 
decoder/encoder 200 includes a video decoder 202 and a video encoder 204. The 
decoder 202 receives MPEG-2 input video data 206 encoded according to the 
MPEG-2 video format at an initial bit rate, and fully decodes the input video data 
206. 

Decoding the input video data 206 involves first applying a variable length 
decoder (VLD) 208. The VLD 208 extracts certain information from the input 
MPEG-2 video including motion vectors 210 (shown in dotted lines) and headers 
212 (shown in dotted lines). The VLD 208 outputs decoded video coefficients to 
a dequantization module (or inverse quantizer) 214. 

The dequantization module 214 dequantizes the decoded coefficients by 
applying a dequantization function, , which uses a quantization parameter, qi, 
that corresponds to the initial bit rate of the input video data 206. The 
dequantization module 214 multiplies video data coefficients by corresponding 
values in a quantization matrix and a quantization scale factor. The dequantized 
video data is then input into an inverse discrete cosine transform (IDCT) module 
216, which decompresses the video data according to an IDCT algorithm. Any 
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methods of variable length decoding, video dequantizing and decompressing as 
may be known in the art can be applied by the VLC 208, the dequantization 
module 214 and IDCT module 216, respectively. 

The output of the IDCT module 216 is combined with motion compensated 
video from a previously decoded macroblock or frame. The combination of the 
IDCT module 216 and the motion compensated video are the decoded video data 
218 that are output from the decoder 202. The decoded video data 218 are fed 
back into a frame buffer 220, which stores the decoded video data 218. The 
frame buffer 220 outputs the decoded video data 218 to a motion compensation 
module 222. The motion compensation module 222 uses the motion vectors 210 
and the decoded video data 218 to generate motion compensated macroblocks for 
the following frame. 

The decoded video data 218 is input into the encoder 204. The encoder 
204 fully re-encodes the decoded video data 218 by recompressing, re-quantizing, 
and variable length coding the decoded video data 218. As shown, the encoder 
204 includes a difference module 224 that outputs the difference between the 
decoded video data 218 and motion compensated transcoded video data 226. 

The output of the difference module 224 is input into a discrete cosine 
transform (DCT) module 228, which compresses the data using a DCT algorithm. 
The DCT module 228 outputs DCT coefficients to a quantization module 230 that 
quantizes the DCT coefficients with a quantization fiinction, Q2, which applies a 
target quantization parameter, q2, that corresponds to the target bit rate. 
Quantization refers to assigning a number of bits to represent a unit of video data, 
such as a pixel, DCT coefficient, or video frame. More quantization levels 
available for assigning video values typically corresponds to higher bit rates. The 
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quantization module 230 outputs quantized DCT coefficients at the desired 
quantization levels to a variable length coder (VLC) 232. The VLC 232 performs 
a variable length coding algorithm on the re-quantized data to generate transcoded 
output video 234 at a desired bit rate. 

The quantized DCT coefficients from the quantization module 230 are also 
dequantized by dequantization module 236. The dequantization module 236 
applies an inverse quantization fiinction, , using the target quantization 
parameter, q2, to generate dequantized DCT coefficients. The dequantized DCT 
coefficients are input to an IDCT module 238 that performs an inverse DCT on the 
dequantized DCT coefficients. The output of the IDCT 238 is combined with the 
motion compensated transcoded video data 226 and then stored in a frame buffer 
240. The output of the fi'ame buffer 240 is input to a motion compensator module 
242, which uses the motion vectors 210 to generate the motion compensated 
transcoded video data 226. 

Although the cascaded decoder/encoder approach described in Fig. 2 yields 
acceptable transcoding results with little or no drift error, the cascaded 
arrangement 200 is highly complex. In addition, the cascaded arrangement 200 is 
highly CPU intensive, requiring continuous feedback, dequantizing, 
decompressing, and motion estimation. If the CPU is relatively low-speed, or 
low-power, the CPU may not be able to maintain the required rate of transcoding. 

Figs. 3-5 illustrate exemplary altematives to the highly complex cascaded 
arrangement 200. The exemplary altematives shown in Figs. 3-5 can facilitate 
fast, high-quality, adaptable transcoding with limited or no drift error, while 
adjusting to system conditions, such as CPU utilization. 
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Fig. 3 is a block diagram of an exemplary video transcoder 300 that can be 
used for transcoding input video data 302 into output video data 304 at a target bit 
rate. As discussed, the transcoder 300 is typically implemented on server computer, 
or other type of suitable computing device. The input video data 302 is assumed to 
be encoded at an initial bit rate which is easily determined by the transcoder 300 or 
an application running on the server. 

The output video data 304 is also referred to as transcoded video data 304. 
The transcoded video data 304 does not differ from the input video data 302 in 
terms of video encoding format. Thus, for example, if the input video data 302 is 
MPEG-2 video, the transcoded video data 304 will be represented in the MPEG-2 
format. The transcoded video data 304 may differ from the input video data 302 in 
terms of bit rate. The transcoded video data 304 may also differ from the input 
video data 302 in terms of frame composition. As discussed in further detail 
below, the transcoder 300 may selectively skip, or drop, certain frames. 

A visual comparison of Fig. 2 and Fig. 3 shows that Fig. 3 includes modules 
that are analogous to some of the modules in Fig. 2, but that Fig. 3 is substantially 
scaled down from the cascaded transcoder of Fig. 2. For example, although the 
transcoder 300 includes a variable coder and decoder as in the cascaded 
arrangement, the transcoder 300 does not include an IDCT module analogous to 
the IDCT module 216 or a DCT module analogous to the DCT module 228. 

One reason the transcoder 300 of Fig. 3 can be so greatly simplified from 
that of Fig. 2 is that the transcoder of Fig. 3 only transcodes the bit rate and/or 
selectively skips video frames from those of the input video data 304, rather than 
changing the actual format or spatial resolution of the input video data 304, which 
are characteristic of some traditional transcoders. The reduction in complexity 
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offered by the transcoder 300 can be beneficial in some situations. In addition, 
the transcoder 300 includes switching modules (described below) that provide for 
complexity scalability. 

With specific regard to the transcoder 300, a variable length decoder (VLD) 
306 receives the input video data 302 and applies variable length decoding to the 
input data. The variable length decoder 306 extracts header information 308 
(shown with dotted lines) and motion vector information 310 (shown with dotted 
lines) from the input video data 302. The header information 308 and the motion 
vector information 310 are fed forward in the transcoder 300 for use by other 
processes/devices in the transcoder 300. 

The output of the VLD 306 is input to a dequantization module 312, which 
dequantizes the video data according to a dequantization function, . The 
dequantization function, Q'"" , applies quantization parameter qi, which 
corresponds to an initial bit rate of the input video data 302. As discussed, the 
initial bit rate of the input video data 302 is readily available by the transcoder 300. 
The output of the dequantization module 312 includes dequantized compression 
coefficients 3 14 (e.g., DCT coefficients) representative of the input video data 302. 

As is discussed in further detail below, the dequantized compression 
coefficients 314 are fed forward and used to determine transcoding error 
information later in the transcoding process. The dequantized compression 
coefficients 314 are also summed with transcoding error compensation data 316 at 
an error compensation module 318. The error compensation module 318 outputs 
dequantized compression coefficients that have been adjusted or compensated with 
the transcoding error compensation data 316. The transcoding error 
compensation data 316 is also referred to herein as drift error, accumulated drift 

Ue & Hayes, PLLC 1 3 MSI-1695US 

305876.01 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



error, accumulated transcoding error, or accumulated error. The manner of 
generating the transcoding error compensation 3 16 is discussed further below. 

The output of the error compensation module 318 is input into a 
quantization module 320, which quantizes the error compensated video data 
according to a quantization function, Q2 . The quantization function, Q2 , 
quantizes the error compensated video data based on a target quantization 
parameter, q2, which corresponds to a target bit rate. A rate control input 322 to 
the quantization module 320 controls the quantization parameter, q2 , to achieve 
the target bit rate. The rate control input 322 is discussed in further detail with 
regard to Fig. 8. The output of the quantization module 320 is re-quantized video 
data 324 at the target bit rate. 

The re-quantized video data 324 is also referred to as 'transrated' video data 
324. The term 'transrated' means that the video data that is output by the 
quantization module 320 may have a different bit rate than the initial bit rate of the 
input video data 302. Typically the bit rate of the transrated video data 324 is 
lower than the initial bit rate in order to adapt to certain system conditions, such as 
lower network bandwidth or channel capacity. 

The transrated video data 324 is input to a variable length coder (VLC) 326, 
which variable length encodes the transrated video data 324. The output of the 
VLC 326 is the transcoded video data 304, The transrated video data 324 is input 
to another dequantization module 328, which dequantizes the transrated video data 
324 using a dequantization function, The dequantization function, Q~\ 

applies the inverse of the quantization module 320. Thus, the dequantization 
function, ? dequantizes the transrated video data 324 using the target 
quantization (i.e., dequantization) parameter, q2. 
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The output of the dequantization module 328 is dequantized transrated 
video data 330. The dequantized transrated video data 330 is input into an error 
determination module 332. The error determination module 332 subtracts the 
dequantized transrated video data 330 from the initial dequantized compression 
coefficients 314. The output of the error determination module 332 represents 
transcoder error or drift error. 

An inverse discrete cosine transform (IDCT) module 334 performs an IDCT 
function on the transcoder error data from the error determination module 332. 
The IDCT module 334 converts the transcoder error data from the frequency 
domain to the pixel domain. Thus, the output of the IDCT module 334 represents 
transcoder error data in the pixel domain. The pixel domain transcoder error data 
is summed with motion compensated error data by an accumulator 336. 

The output of the accumulator 336 is stored in an error buffer 338. The 
error buffer 338 includes accumulated transcoder error or accumulated drift error. 
A motion compensator 340 applies the motion vectors 310 to the accumulated 
transcoder error from the error buffer 338. The motion compensated data from 
the motion compensator 340 is fed back into the accumulator 336. The motion 
compensated data is also input into a discrete cosine transform (DCT) module 342. 

The DCT module 342 applies a DCT fiinction to the motion compensated 
error data. The DCT module 342 converts the pixel domain error data back into 
the frequency domain. The output of the DCT module is the transcoding error 
compensation data 316 referred to above. As discussed above, the transcoding 
error compensation data 316 is added to the initial dequantized compression 
coefficients 314 by the summer 318. The summer 318 performs an important 
task of adjusting the input dequantized video data by the accumulated drift error in 
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order to reduce or eliminate error due to drift that may arise during the transcoding 
process. 

The dequantization module 312, the quantization module 320, the 
dequantization module 328, and the error determination module 332, may 
collectively be referred to as a re-quantization module. An exemplary re- 
quantization module is illustrated in Fig. 6 and discussed in fiirther detail below. 
Because the processes of dequantizing and quantizing can be computationally 
demanding, the exemplary re-quantization module in Fig. 6 includes a useful look- 
up table (LUT) that can save processor time. 

As discussed above, the transcoder 300 includes one or more switching 
modules or components that provide for complexity scalability of the transcoder 
300. The switching modules provide for system condition adaptability and video 
frame-type adaptability. Exemplary switching components are described in detail 
below. 

One implementation of the transcoder 300 includes skip B -frame switch(es) 
344, an accumulate error switch 346, and a compensate switch 348. The switches 
344, 346 and 348 provide for selective addition and/or removal of certain portions 
or loops in the transcoder 300. During operation, when system conditions 
change, it may be beneficial to disable and/or enable one or more of the portions of 
the transcoder to achieve desired system objectives using the switches, thereby 
disabling and/or enabling one or more of the error-compensating and error 
accumulating operations in the transcoding process. 

With more specific regard to the compensate switch 348, the compensate 
switch 348 controls whether the dequantized input video data 314 will be 
compensated with the accumulated transcoding (or drift) error 316. The notation 
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TH' represents a threshold value (e.g., Q2/2). If the accumulated drift error 316 is 
more than the threshold, the compensate switch 348 is closed and the accumulated 
drift error 316 is combined with the dequantized input video data 314 at the 
summer module 318; otherwise, error-compensation will not occur and the input 
video data 302 will simply go through a re-quantization process. 

A particular implementation of the compensate switch involves a triple- 
threshold algorithm to control whether or not to update an 8x8 block in a frame 
with the accumulated errors. In this implementation, the threshold of each 8x8 
block is dynamically selected along with the counter of a block that is not updated. 

In the triple-threshold algorithm, an error metric, 5, can be calculated that 
represents the accumulated error of pixels in a video block. In a particular 
implementation of the triple-threshold algorithm, the error metric is a sum of 
absolute error of the accumulated error of each 8x8 block as shown in equation 
(1): 

^=ZZKw)| (1) 

/=o j=o 

where e(i, j) represents the accumulated error of each pixel in a block. In 
other implementations, the error metric can include other fiinctions of the error, 
such as, but not limited to, the mean square error. 

To fiirther illustrate the triple-threshold approach, assume there are three 
thresholds: THl, TH2, and TH3, where THl > TH2 > TH3. The larger the 
threshold, the less likelihood an 8x8 block will be compensated with the 
accumulated error. 
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In order to achieve a better trade-off between transcoding complexity and 
picture quality, the selection of the threshold is described with pseudo-code shown 
below, where every block has a unique counter variable blockjcounter. The larger 
blockjoounter, the less the selected threshold, then the more likelihood that blocks 
are compensated with the accumulated error. The parameter blockjcounter is set 
to zero when the block is an intra block (i.e., an 8x8 block coded without any 
prediction from other frames). 



switch {block_counter) 

{ 

case 0: TH = THl; breaks- 

case 1: TH = TH2; break; 

default: TH = TH3; break; 

} 

for (i = 0;i <= 7;i++) { 

for (j = 0; j <= 7;i++) { 
s += abs (e (i, j ) ) ; 

} 

} 

if (Intra block) 

block_counter = 0; 
else if (Inter block in P picture) 
{ 

if (s > TH) 
{ 

//drifting error compensation; 
if (jbIocic_counter > 0) 

blockjcounter = block_counter - 1; 

} 

else 
{ 

//no drifting error compensation; 
block counter = block counter + 1; 
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When block_counter is 0, i.e., drifting error propagated from previous 
frames is zero or very small, the high threshold THl is selected. When 
blockjoounter is 1, the intermediate threshold TH2 is selected; otherwise the lower 
threshold TH3 is selected. If an inter block (i.e., an 8x8 block that is coded with 
prediction from another frame) in a P-frame is compensated with the accumulated 
errors and its block counter is more than 0, blockjcounter decreases 1. If an inter 
block in a P-frame is not compensated with the accumulated errors, blockjcounter 
increases 1. 

It will be appreciated that if the compensate switch 348 is opened, the 
transcoder 300 and the transcoding process is greatly simplified over the cascaded 
transcoder 200 shown in Fig. 2. For example, when the compensate switch 348 is 
open, the DCT module 342 and summer module 318 can be removed. By 
opening the compensate switch 348, the quality of video may be reduced because 
drift error is not being added back into the video data; however, if the CPU does 
not have enough cycles to perform the transcoding with drift error compensation, 
opening the compensate switch 348 may ensure that the video data continues to be 
transcoded, albeit at a slightly lower quality. 

With regard to the skip B-frame switch(es) 344, the skip B-frame switch(es) 
344 applies only to B-frames. In one implementation, if the TH value is set to a 
very high value and the server still is unable to handle real-time transcoding, the 
skip B-frame switch 344 will be opened. When the skip B-frame switch(es) are 
open, any B-frames in the input video data 302 will not be transcoded or 
transmitted. In one implementation, instead of transmitting a transcoded B- 
frame, the transcoder 300 will add some header information that notifies the 
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decoder that the B-jframe has been skipped. The VLD 306 can determine whether 
a frame is a B-frame and insert the information into the headers 308. 

If the accumulate error switch 346 is opened, the transcoder 300 is an open- 
loop system. Thus, the accumulate error switch 346 can be opened to prevent 
accumulation and compensation of drift error, and thereby reduce required 
processing time during the transcoding process. In some extreme cases where the 
server still doesn't have enough cycles (even with switches 344 and 348 opened) 
for the transcoding process, the accumulate error switch 346 can be opened. When 
the accumulate error switch 346 is opened the drifting error for the current group 
of pictures (GOP) will not be accumulated. The transcoding process runs in open- 
loop mode where the entire motion compensation loop is eliminated. In the open- 
loop mode, the skip B-frame switch 344 can still be turned on to save extra 
computational cycles. 

In one implementation, the skip B-frame switch(es) 344, the accumulate 
error switch 346, and the compensate switch 348 are all closed when P-frames 
from the input video data 302 are being transcoded. In this implementation, the 
accumulated drift error 316 will continue to be accumulated and used to 
compensate for drift due to transcoding. 

It is to be understood that the term 'switch' as used herein is intended in the 
broadest meaning of the term. The term 'switch' is not to be limited to well- 
known mechanical or electromechanical switches. On the contrary, because the 
transcoder 300 can be readily implemented in any combination of hardware, 
software, or firmware, the term 'switch' includes any type of hardware switch, 
software switch, or firmware switch that causes an output state corresponding to an 
associated input condition. 
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By way of example, and not limitation, the switches described herein may 
be implemented with a software instruction, such as an 'if-then' statement, or a 
'switch' statement, in the well-known programming language. As another 
example, any of the switches described herein may be implemented with a solid 
state device, such as a transistor, which allows for selection among multiple states 
in response to an input state. As yet another example, switches may be 
implemented with programmable memory devices, such as Programmable Read 
Only Memory (PROM). Those skilled in the art will recognize numerous other 
implementations of switches described herein that fall within the scope of the 
claims recited below. 

Fig. 4 is a block diagram of another exemplary video transcoder 400 that 
can be used for encoding I-frames of input video data 302 encoded at an initial bit 
rate into I-frames of transcoded video data 304 encoded at a target bit rate. Not 
all modules included in the transcoder 300 (Fig. 3) are necessary for I and B 
frames. Fig. 4 depicts a transcoder 400 for transrating the input video data 302 
for I-frames. Below, Fig. 5 illustrates a variation of the transcoder 300 (Fig. 3) 
that can be used for transrating B-frames. 

The transcoder 400 can be easily obtained using the transcoder 300 (Fig. 3) 
by opening the compensate switch 348 of transcoder 300. Since both I-frames 
and P-frames are used by the decoder at the client as references for other frames, it 
is necessary to update the error buffer 338 in Fig. 4 with the accumulated drift 
errors. However, because I-frames are coded without any prediction from other 
frames, no drift error compensation is needed for the I-frames. Thus, as shown in 
Fig. 4, the accumulate error switch 346 is closed, thereby causing the transcoder 
400 to accumulate drift error in the error buffer 338; however, the compensate 
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switch 348 (Fig. 3) is opened, thereby avoiding the additional complexity 
associated with compensating for drift error. 

The reader will appreciate that the transcoder 400 is a scaled version of the 
transcoder 300 by virtue of the compensate switch 348. The complexity of the 
transcoder 300 can be dynamically scaled in response to detected system 
conditions to achieve numerous desired coding objectives. Fig. 5 illustrates how 
a user may even fiirther scale down the complexity of the transcoder 300, and yet 
still provide for adjustment of bit rate in response to system conditions. 

Without an error accumulation and compensation feedback loop (i.e., in 
open-loop mode), adjusting the bit rate of the encoded data can result in reduction 
of video quality due to drifting error. For example, a simple open-loop approach 
is to re-quantize DCT coefficients in the original video bit stream to achieve a 
desired lower bit rate without consideration of the P-frames; however, any changes 
in the P-frame not only would result in errors in the associated decoded frame but 
also would cause errors in subsequent frames, and may even accumulate into larger 
and larger errors in later frames. Such an accumulation of errors is often referred 
to as a "drift error" problem, which has been discussed herein. 

In some cases, a limited amount of drift error may be acceptable. For 
example, when the server CPU is at or near maximum utilization and can not keep 
up with the necessary speed of transcoding, error accumulation and/or 
compensation may impose too great of a burden on the CPU and result in a large 
drop in quality. Allowing for a limited, or acceptable, amount of drift error may 
be beneficial when transcoding B-frames because B-frames are not used as a 
reference image during reconstruction. Thus, any drifting error associated with a 
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B-frame is not propagated to other frames. Fig. 5 illustrates such an open-loop 
re-quantization model. 

Fig. 5 is a block diagram of another exemplary video transcoder 500 that 
can be used for encoding B-frames of input video data 302 encoded at an initial bit 
rate and generate transcoded video data 304 encoded at a target bit rate. The 
open-loop transcoder 500 is a greatly scaled down version of the transcoder 300 
(Fig. 3). As is known, B-frames don't contribute to drift errors because B-frames 
are not relied on upon as reference frames by the decoder for prediction. Thus, 
the transcoder 500 has been scaled down by opening the compensate switch 348 
and the accumulate switch 346 of the transcoder 300. 

Exemplary systems for re-quantizing video data 

Fig. 6 is a block diagram of a re-quantization module 600 that employs a 
look-up table for rapid re-quantization of input video data. In Fig. 6, input level 
602 (represented by Xj) represents the decoded quantization level that is extracted 
from the initial video bit stream by variable length decoding (VLD); a DCT 
coefficient 604 (represented by X2) is obtained by a de-quantization process 606 
with the initial quantization parameter q\\ the output quantization (i.e., re- 
quantization) level 608 (represented hyXs) is calculated by a second quantization 
process 610 with a target quantization parameter ^2- The above computations are 
formulated in equations (2) and (3) below: 

X,=Q-\X,,q^,,q,) (2) 
X,=Q{X,,q„^,q,) (3) 
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Q( ) is the quantization function, Q'^( ) is the de-quantization function, and 
qm is the corresponding element in the MPEG-2 quantization matrix. An exemplary 
default intra block quantization matrix 700 and inter block quantization matrix 702 
are illustrated in Fig. 7. 

Incorporating equation (2) with equation (3), can be calculated with the 
following equation: 

X, = Q{X,,q„^,q^)^Q{Q-\X,,q^,qM.^^^^ (4) 

Fj() is the de-quantization and quantization process. 

For accumiilating the re-quantization error, a second de-quantization DCT 
coefficient 612 (represented as X4) is determined by a second dequantization 
module 614 according to equation (5): 

X,=Q-\X,,q^,q,) (5) 

Incorporating equations (4) and (5) results in: 

=0-^(^1 (Xp^„,^p^2X^m'^2) = ^2(^1.^.1. ^P^2) (6) 

) represents the first de-quantization process 606, the quantization 
process 610 and second de-quantization process 614. A re-quantization error 616 
(represented asXj) is calculated as follows: 

^5=^2-^4=e"*(^P^«.^l)-^2(-'^p9«.^P^2) = ^3(^p9«|.^p92) (7) 
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Traditional approaches involve calculating the output quantization level X2 
and the re-quantization error Xs during the transcoding or transrating process. It 
will be appreciated that calculating the values and Xs can be time-consuming 
and resource-intensive tasks for a processor. In an alternative implementation, 
the functions Fj( ) and Fsf ) can be simply implemented using a look-up table 
(LUT) 618 to determine X3 and X^ The LUT 618 takes input parameters Xu qm^ 
quQi and outputs scaled versions ofXs and Jfj. These scaled versions ofXj and JG 
are referred to here as X[ and X[ , respectively. 

A particular implementation of the LUT 618 includes an intra re- 
quantization table 620 to re-quantize intra blocks and an inter re-quantization table 
622 to re-quantize inter blocks. Thus, the intra re-quantization table 620 outputs 
values Xs and X5 that correspond to X\ levels in intra blocks; and the inter re- 
quantization table 622 outputs values X3 and Xs that correspond to Xx levels in inter 
blocks. 

With regard to Fig. 7, for intra blocks, can be selected from a matrix, 
such as the exemplary default intra block quantization matrix 700. In the 
illustration, the default intra block quantization matrix 700 includes a set of values 
{16, 19, 22, 24, 26, 27, 29, 32, 34, 35, 37, 38, 40, 46, 48, 56, 58, 69, 83}. Thus, 
qra can take on one of 19 possible values. Assume that the quantization parameters 
qx and qi each vary from 1 to 31. However, because of bit-rate reduction 
transcoding, it is reasonable to assume 5^2-^15 so there are 31x16 possible 
combinations of qx and ^2- 

In a particular implementation of the re-quantization module 600, the input 
decoded level Xx is in the range [1, 2047]. Considering all possible combinations 
of Xi, q^, qx and qi, the intra re-quantization table 620 consists of 3 1 x 16x 19x2047 
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elements. Each element has 4 bytes with 2 byte for Xs and 2 bytes for X5. 
Therefore, the intra re-quantization table 620 would require about 77M bytes. For 
inter blocks, the exemplary default inter quantization matrix 702 includes only 
value of 16, from which q„^ is selected. So the inter re-quantization table 622 
requires about 4M bytes. 

The sizes of the intra re-quantization table 620 and inter re-quantization 
table 622 can be reduced significantly. While it is possible that Xi 602 can range 
from [1, 2047], Xj 602 is frequently located in a certain range, e.g., [1, 40]. When 
Xi 602 is more than a predetermined switching value, the re-quantization process 
is not implemented by look-up table 618 so as to reduce the table sizes. The 
predetermined switching value may be chosen depending on the particular 
implementation. In one implementation, the predetermined switching value is set 
to 40. 

In MPEG-2, the quantization niatrix can be defined by a user and may differ 
from the default matrices 700 and 702 shown in Fig. 7. In the worst case, may 
have 64 different values. The table size will be far more than the previous 
estimation. Considering the general case, when the two matrices are constructed, 
q^n in equations (4) and (7) is always set to 16. In a particular implementation, the 
re-quantization value X^ and the error value Xs can be calculated with the 
parameter, q„^, with a multiply and a right shift (i.e., a divide operation) as follows: 



X[^q 
16 



m 



(8) 



16 



m 



(9) 
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X'^ represents the results of Fi() with fixed q^i = 16, and X'^ represents the 
resuhs of F^O with fixed = 16. As discussed above, and X'^ can be 
obtained by the look-up table 618. The multiply and shift operations of equations 
(8) and (9) can be implemented with weighting modules 624. By the above two 
improvements, the size of intra and inter matrix has been shown to be around 79K 
bytes. Furthermore, the intra quantization table 620 and the inter quantization table 
622 do not need to be recalculated when the quantization matrices 700 and 702 are 
changed. 

Referring again to Fig. 6, the re-quantizer 600 includes a switch 626 that is 
controlled by input level value 602. In the implementation shown in Fig. 6, when 
input level 602 is more than 40, the re-quantization is performed without use of the 
look-up table 618; otherwise the look-up table 618 is used. As discussed above, 
the input level at which the switch 626 toggles to and from the look-up table 618 
path can be adjustable. 

Fig. 8 is a block diagram of a target bit rate controller which may be used to 
dynamically control a quantization parameter corresponding to the target bit rate of 
transcoded video data. In order to support more accurate transcoding rate and 
modify the transcoding rate on the fly, a new rate control scheme is proposed in 
Fig. 8, where rate information and quantization parameters in input MPEG-2 bit 
stream are fiilly utilized. When a MPEG-2 video element stream (VES) is input 
into the transcoder, the initial average bit rate R is already known. Obviously, the 
target transcoding bit rate R ' is also known. 

A calculation module 802 calculates the target number of bits B' and the 
quantization parameter Q' of each transcoded frame. In the implementation 
shown in Fig. 8, it is assumed that the bits of each frame in the input MPEG-2 
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VES are allocated at the constraint of a long virtual buffer just as in the traditional 
rate control techniques. It is also assumed that there is no buffer over-flow or 
buffer under-flow. Therefore, the target number of bits B ' can be calculated by a 
linear model as follows: 

F^aB. (10) 
where a^R'IR 

B represents the number of bits of the current frame in the input stream. The 
multiplier, a , is the ratio of the target transcoding bit rate R ' and the input 
average bit rate R. The quantization parameter Q' can be calculated by the 
following equation (11): 

Q = f{B\B,Q). (11) 

Here Q is the average quantization parameter of the current frame in the 
input MPEG-2 VES. The relationship between coded bits and quantization 
parameter may have many different representations. When they are described as a 
reciprocal relationship, the formula (11) can be wiritten as 



2'=^ (12) 



If the quantization parameter is allowed to change in each macroblock, Q ' 
is used as a preliminary, or temporary, quantization parameter. The actual target 
quantization parameter may be further refined by a control module 804. The 
control module 804 allocates the target bits B' to each macroblock with a frame 
virtual buffer. The size of buffer is equal to the target bits B\ A difference 
module 706 determines the difference between the actual bits J5" and the target 
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bits B\ The difference is fed back into the calculating module 802 in the next 
frame. 

Another implementation of the calculating module 802 uses another 
approach. When the number of target bits 5' in a certain frame is calculated, it is 
useful to consider the differences between the number of target bits B' and the 
actual number of bits B'' m previous frames. During operation, 5' is calculated 
using equation (10) only on the first frame of each group of pictures (GOP) or the 
beginning of a sequence. For subsequent frames, the calculating module 802 
applies a different algorithm. 

Assume there are N frames to be transcoded. The following formula is used 
to calculate the ratio 




Here 5"(/) and B{i) are the number of bits for frame / in the transcoded 
bit stream and the input bit stream (i.e., input video data), respectively. The target 
number of bits B' (k) for frame k can be represented as follows: 

B(k) = axB{k) (14) 

Moreover, for different frame type, we can use a weight factor to modify 
the target bit number to get better frame quality, and then we can re-write the 
above equation as 

B(k) = wxaxB{k) (15) 

The value w is the weighting factor for I, P, or B frame, respectively. 

The control module 804 implements a macroblock level control scheme. 
The macroblock level control scheme ensures that the target number of bits B ' for 
each picture is met, and optimizes the quantization parameter for each macroblock 
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based on R-D relationship which would produce a minimum average distortion 
with rate constraint. 

For jth macroblock, the distortion dj and the bit number r^(^y) can be 
represents as follows. 

(16) 

o o 

Hj Hj 

Here, and are bit number and quantization parameter for the same 
macroblock in the input bit stream, andjc^ is the complexity of this macroblock. 
The complexity, Xj , can be estimated by multiplying rj and . The 
quantization parameters in the transcoded bit stream can be calculated with the 
following optimization 



N 

mm 



Z^yfey). (18) 



subject to f;ofc)=5» (19) 



y=i 



Here, 5 ' is the target number of bits for this frame. 

A Lagrangian method can be used to solve the above optimization problem. 



>, 12 



(20) 



By solving the above extremum, the quantization parameter for each 
macroblock can be represented as 
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N , 

gj = ^ , n = 1,2,..., N . 



(21) 



The macroblock level rate control scheme implemented by the control 
module 804 is described as follows. 

y-i 

For macroblock j, the left target bit number is A5 = B'-^r„(q„) , where the 

«=i 

sum item is the coded bits of previous j-1 macroblocks. 

If A5<0, qj =Qayg/j\ i-^'^ we do not calculate the quantization parameter 

again. Instead, the average value of previous j-1 macroblocks is used; otherwise 

The difference of quantization parameters calculated in frame level and 
macroblock level is equal to Aqj = qj -Q . If Ag^ < -2 , then Ag^. = -2 , else if 
/Haqj > 2 , then tiqj = 2 . The final quantization parameter for macroblock j is 

Q+Aqj. 

Exemplary operation for encoding video data at a target bit rate 

Fig. 9 illustrates a transrating operation flow or algorithm 900 including 
exemplary operations for transcoding input video data encoded at an initial bit rate 
into output video data encoded at a target bit rate. The transrating operation flow 
900 does not change the underlying format of the video data. For purposes of 
illustration, but not limitation, it is assumed that the video data is encoded 
according to the MPEG-2 format. 
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A receiving operation 902 receives the initial video data encoded at an 
initial quantization level corresponding to an initial bit rate. Typically, the 
receiving operation 902 reads the initial video data from one or more files stored in 
memory from which the video data can be made available to clients, applications, 
or other users. As discussed above, the initial bit rate and initial quantization 
level of the initial video data is easily determined. 

A dequantizing operation 904 dequantizes the initial video data using a 
quantization parameter corresponding to the initial bit rate. The output of the 
dequantizing operation 904 includes one or more discrete cosine transform (DCT) 
coefficients. The DCT coefficients are compression coefficients that represent 
the initial video data in the frequency domain. 

A determining operation 906 determines transcoding error, or drift error, 
corresponding to the difference between initial video data and the transrated video 
data in the frequency domain. Due to the linearity of the DCT, DCT coefficients 
of the transrated video data can simply be subtracted from the DCT coefficients of 
the initial video data. 

An accumulating operation 908 accumulates the determined transcoding 
error. The accumulating operation 908 adds the determined transcoding error 
from one frame with transcoding error(s) from previously transrated video frame in 
order to account for the drift error associated with reducing the bit rate of the video 
data. In one implementation of the accumulating operation 908, drift error is 
accumulated in the pixel domain, rather than the frequency domain, and stored in 
an error buffer. 

A compensating operation 910 combines the initial video data with the 
accumulated drift error to compensate the video data with the drift error. The 
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compensating operation converts the accumulated drift error from the pixel domain 
to the frequency domain and then sums the accumulated drift error (in the 
frequency domain) with the dequantized DCT coefficients from the dequantizing 
operation 904. The output of the compensating operation 910 includes error- 
compensated compression coefficients. 

A re-quantizing operation 912 re-quantizes the error-compensated 
compression coefficients using a target quantization parameter corresponding to 
the target bit rate. The target quantization parameter can be dynamically controlled 
on a frame-by-frame, or a macroblock-by-macroblock basis. The output data 
from the re-quantizing operation 912 are transrated compression coefficients that 
can be used to determine the drift error in the determining operation 906. 

Fig. 10 illustrates an adapting operation flow or algorithm 1000 including 
exemplary operations for scaling the complexity of a transcoder in response to 
system conditions and/or video frames. The algorithm 1000 can be implemented 
by the servers and transcoders discussed herein, as well as any general purpose 
computer. As with the description of Fig. 9, for illustration purposes, it is 
assumed that the video data that is transcoded is in the MPEG-2 format. 

A receiving operation 1002 receives MPEG-2 video data encoded at an 
initial bit rate. A determining operation 1004 determines whether the video data 
is B-frame video data. The determining operation 1004 may be implemented 
with a variable length encoder (VLD), such as the VLD described above, which 
can extract frame-type information from video header data. If the determining 
operation 1004 determines that the video data is B-frame video, the adapting 
operation 1000 branches 'NO' to another determining operation 1006. The 
determining operation 1006 determines whether the central processing unit (CPU) 
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is being utilized to its maximum capacity and hence cannot transcode the video 
data in real-time. One implementation of the deteraiining operation 1006 
determines whether the CPU has enough clock cycles available to transcode the 
video data in real-time. 

If the determining operation 1006 determines that the CPU utilization is 
maximized, the adapting operation 1000 branches 'YES' to a replacing operation. 
The replacing operation 1008 replaces the B-frame data with only header data that 
indicates to the decoder that the B-frame has been skipped. The replacing 
operation 1008 then only transmits the header data and not the B-frame video data. 
After the replacing operation, the adapting operation returns to the receiving 
operation without transcoding the B-frame data. 

Referring to the determining operation 1004, if it is determined that the 
video data is not B-frame data, the adapting operation branches 'NO' to an 
accumulating operation 1010. The accumulating operation 1010 generates and 
accumulates errors due to transrating. One implementation of the accumulating 
operation 1010 subtracts transrated video coefficients from the received video 
coefficients and adds the difference to accumulated error from previously 
transrated video data. 

After the accumulating operation 1010, another determining operation 1012 
determines whether the video data is I-frame video data. If it is determined that 
the video data is I-frame video data, the adapting operation branches 'YES' to a 
transcoding operation 1014, without compensating the video data with the 
accumulated transrating error. 

Likewise, with regard to the determining operation 1006, if the determining 
operation 1006 determines that the CPU does have sufficient capacity to transcode 
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in real-time, the adapting operation 1000 branches 'NO' to the transcoding 
operation 1014, which transcodes (e.g., re-quantizes, variable length encodes) the 
B-frame data without compensating with the accumulated error. 

Referring again to the determining operation 1012, if it is determined that 
the video data is not I-frame data, then the video data is P-frame data, and the 
adapting operation 1000 branches 'NO' to another determining operation 1016. 
The determining operation 1016 determines whether the accumulated error due to 
transrating is greater than a predetermined threshold. If the accumulated error is 
not greater than a predetermined threshold, the adapting operation branches 'NO' 
to the transcoding operation, which transcodes the P-frame data at quantization 
levels corresponding to the target bit rate. 

If the determining operation 1016 is determines that the accumulated error 
is greater than the predetermined threshold, the adapting operation branches 'YES' 
to a compensating operation 1018. The compensating operation 1018 
compensates the input video data with the accumulated error in order to limit drift 
error. After the video data has been compensated, the transcoding operation 1014 
re-quantizes and variable length encodes the error compensated video data at a 
quantization level corresponding to the target bit rate. The adapting operation 
1000 may then return to the receiving operation 1002 to continue the transcoding 
process, if more video data is to be transcoded. 

Various modules and techniques may be described herein in the general 
context of computer-executable instructions, such as program modules, executed 
by one or more computers or other devices. Generally, program modules include 
routines, programs, objects, components, data structures, etc. that perform 
particular tasks or implement particular abstract data types. Typically, the 
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functionality of the program modules may be combined or distributed as desired in 
various embodiments. 

An implementation of these modules and techniques may be stored on or 
transmitted across some form of computer-readable media. Computer-readable 
media can be any available media that can be accessed by a computer. By way of 
example, and not limitation, computer-readable media may comprise "computer 
storage media" and "communications media." 

"Computer storage media" includes volatile and non-volatile, removable 
and non-removable media implemented in any method or technology for storage of 
information such as computer-readable instructions, data structures, program 
modules, or other data. Computer storage media includes, but is not limited to, 
RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, 
digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic 
tape, magnetic disk storage or other magnetic storage devices, or any other 
medium which can be used to store the desired information and which can be 
accessed by a computer. 

"Communication media" typically embodies computer-readable 
instructions, data structures, program modules, or other data in a modulated data 
signal, such as carrier wave or other transport mechanism. Communication media 
also includes any information delivery media. The term "modulated data signal" 
means a signal that has one or more of its characteristics set or changed in such a 
manner as to encode information in the signal. By way of example, and not 
limitation, communication media includes wired media such as a wired network or 
direct-wired cormection, and wireless media such as acoustic, RF, infrared, and 
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Other wireless media. Combinations of any of the above are also included within 
the scope of computer-readable media. 

Although some exemplary methods, devices and exemplary systems have 
been illustrated in the accompanying Drawings and described in the foregoing 
Detailed Description, it will be understood that the methods and systems are not 
limited to the exemplary embodiments disclosed, but are capable of numerous 
rearrangements, modifications and substitutions without departing from the spirit 
set forth and defined by the following claims. 
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